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Algebraic Recipes for Integer Programming 

Bernd Sturmfels 



Abstract. Integer programming is concerned with solving linear systems of 
equations over the non-negative integers. The basic question is to find a solu- 
tion which minimizes a given linear objective function for a fixed right hand 
side. Here we also consider parametric versions where the objective function 
and the right hand side are allowed to vary. The main emphasis is on Grobner 
bases, rational generating functions, and how to use existing software packages. 
Concrete applications to problems in statistical modeling will be presented. 



1. An Introductory Coin Problem 



This lecture is about solving linear equations over the non-negative integers. 

■ Our point of departure is the integer programming problem in standard form: 

(1.1) Minimize c • u subject to A ■ u — b and u £ N™. 

The given instance consists of an integer matrix A £ Z dxn , a row vector c£U l and 
Cf~) . a column vector b £ II 1 . The unknown is the column vector u = (iti, . . . , u n ). What 

makes the problem hard is the requirement that the it, be non-negative integers. 

■ As an example consider the following simple coin problem. Suppose you are 
carrying a large collection of coins in your pocket. The allowed coins are pennies (1 

Scent), nickels (5 cents), dimes (10 cents) and quarters (25 cents). The problem is 
to replace your "portfolio" by an equal number of coins having the same monetary 
^ . value, but such that the number of nickels plus the number of quarters is minimized. 

' This problem can be expressed in the standard form (|1.1|) by setting 

(1.2) d = 2,n = 4, A= (\ 5 10 25) and c= (0,1,0,1). 

The right hand side vector b = (^) is left unspecified. Its coordinates b\ and 62 
are the number of coins and value (in pennies) of the portfolio respectively. 

Example 1.1. For instance, if b = ( 114 ), then we seek to express one dollar 
and fourteen cents with ten coins. The optimal solution to this instance of (|1.1|) 
is u = (4,2,0,4) and the optimal value is c • u = 6. In words, you can take four 
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pennies, two nickels and four quarters to make one dollar and fourteen cents with 
ten coins, but it is impossible to do it with less than six nickels or quarters. □ 

A parametric solution to our problem is provided by the Grobner basis 

(1.3) G = { n 3 q-d 4 , n 6 ~p 5 q, n 3 d 4 ~p 5 q 2 , p 5 q 3 - d s }. 

Our Grobner basis in (|1.3f) is expressed as a set of monomial differences, which 
is how they usually appear in computer algebra systems. We note that there are 
two alternative but entirely equivalent ways of writing our Grobner basis. In the 
optimization literature, it is more common to express g as a set of lattice vectors; 

(1.4) g' = { (0,3,-4,1), (-5,6,0,-1), (-5,3,4,-2), (5,0,-8,3) }. 

The Grobner basis is a set of exchange rules which you can use to successively 
improve your portfolio. For instance, the first rule n 3 q — d 4 says that you can 
replace three nickels and one quarter with four dimes. Each of the four moves in g 
changes neither the number of coins nor their value but it decreases the objective 
function value. The crucial property of being a Grobner basis says that if none of 
the exchange rules can be applied then your portfolio is guaranteed to be optimal. 

There is a third way of encoding the Grobner basis, which will be of importance 
in Section 4. Namely, we can also express g as the following generating function: 

(1.5) g" = n 3 D 4 q + P 5 n 6 Q + P 5 n 3 d 4 Q 2 + p 5 D s q 3 . 

In the last representation there are two variables for each column of the matrix A, 
and each monomial represents one exchange rule. The lower case variable represents 
the gain and the upper case variable represents the loss in the exchange of coins. 

In Section 2 we explain how the Grobner basis is constructed for an arbitrary 
matrix A and cost function c, and in Section 3 we discuss the relationship to other 
notions of test sets in integer programming, including Hilbert bases and Graver 
bases, and we introduce Hemmecke's easy-to-use software 4ti2 for computing 
these test sets. In Section 4 we address complexity issues. In particular, we show 
how the Grobner basis can computed in polynomial time when d and n are fixed. 

The power of algebraic methods in integer programming stems from the fact 
that they can answer parametric questions like: What are all the optimal portfolios 
in our coin problem ? Each portfolio is given as a vector u = (u\, U2, W3, 114,) or as 
a monomial p Ul n U2 d U3 q Ui , and we wish to encode all portfolios that are optimal 
solutions of ll.lj l with b = Au. The following three conditions are equivalent: 

(a) The vector u £ N 4 is an optimal portfolio. 

(b) None of the four monomials n 3 q, n e , n 3 d A or p 5 q 3 divides p Ul n U2 d U3 q Ui . 

(c) (u2 < 2 or U4 — 0) and (u 2 < 5) and (112 < 2 or u 3 < 3) and {u\ < 4 or u 4 < 2) . 

The Hilbert series of all optimal solutions is the formal sum of these monomials: 

p Ul n U2 d U3 q U4 

u optimal 

This generating function is equal to the following rational function: 

n 3 q 3 p 5 - n 6 qd 4 + n 6 d 4 + n 3 qd A - q 3 p 5 + n 6 q - n 3 d 4 - n 6 - n 3 q + 1 
(1 - p)(l - n)(l - d)(l - q) ■ 

In Section 4, we will see that such Hilbert series can be computed in polynomial 
time (for fixed d and n) . In Sections 5 and 6 we will focus on applications of integer 
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programming to statistics, and we will argue that Grobner bases and generating 
functions are useful tools for the optimization problems arising in this context. 

The use of Grobner bases as a tool for integer programming first appeared in 
the paper by Conti and Traverso. Their approach was further developed in 
two doctoral dissertations in the Cornell Operations Research Department, writ- 
ten by Thomas (see |Thoj ) and Ho§ten (see |HS1| ) in 1994 and 1997 respectively. 
Subsequently, Hossten and Thomas |HT1| developed an algebraic theory of group 
relaxations, extending the foundational work in integer programming theory which 
was done by Gomory in the 1960's. These and many other important topics will 
not be discussed in this lecture, which aims to be introductory and self-contained. 
Readers wishing to learn more about commutative algebra methods in integer pro- 
gramming are referred to the book |Stuj and the survey articles |HT2j and |Tho2j . 

2. Grobner Bases 

We consider the integer programming problem in standard form (ll.l|) where A 
and c are fixed and b is arbitrary. In this section we further assume that c is generic 
in the sense that (| 1 . 1 1) has a unique optimal solution for every feasible right hand 
side b. In practise, this can always be accomplished by lexicographically perturbing 
the given cost vector c. Consider the infinite set of all optimal solutions, 

Opt^ c = { u G N n : u is the optimal solution of i|l.l|) for b = Au } . 

Suppose that u and u' are vectors in N™ such that u' < u (coordinatewise) and 
u G Opt^ c . Then it can be seen that u' G Opt^ c . We paraphrase this observation 
in the following lemma, using the language of partially ordered sets (= posets). 

Lemma 2.1. The set Opt^ c is an order ideal in the partially ordered set W 1 . 

A basic result about order ideals in the poset N™, known as Dickson's Lemma, 
states that the set of minimal elements in the complementary set Opt^ C \N" is 
finite. We write Min(N™\Opt j4 A for this finite set. Its elements are called the 
minimally non-optimal points of the integer programming family l|l.ll) . Recall that 
our introductory coin example had precisely four minimally non-optimal points: 

Min(N 4 \Opt Ac ) = {(0,3,0,1), (0,6,0,0), (0,3,4,0), (5,0,0,3) }. 

For every g + G Min(N n \Opt^ c ) there exists a unique vector g~ G Opt A c 
such that Ag + = Ag~ . Namely, g~ is the optimal solution to (jl.lfl with b = Ag~ . 

Definition 2.2. The Grobner basis for the matrix A and cost vector c is 

Ga, c = {g + -g- : g + e Min(N"\Opt Ac ) }. 

This is a finite set of lattice vectors in the kernel of A. We can also regard them as 
monomial differences in n unknowns x\ or as monomials in 2n unknowns Xi,yi via 

(2,13,0,-8,5,-7) < > xlx^Xfr — x^Xg < ► x\x^y\x\yl. 

The following theorem states that the Grobner basis is a minimal test set for 
the family of integer programs specified by the matrix A and the cost vector c. 

Theorem 2.3. Let u be a feasible solution of Then u is non-optimal if 

and only if there exists g G Qa,c with g + < u, and in this case u — g is a better 
feasible solution than u. There is no smaller set than Qa,c which has this property. 
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Proof. By construction, every element g = g + — g~ satisfies c • g = c • g + — c ■ 
g~ > 0. The if-direction follows because (u — g) ■ c < u- c and g + < u is equivalent 
to it — g being feasible. For the only-if direction suppose that no g £ Ga,c satisfies 
g + < u. This means that no element of Min(N Tl \Opt j4 c ) lies below u in the poset 
N n . But this means that u £ Opt A c . The minimality of Ga.c holds because every 
element of Min(N n \Opt j4 c ) has to be reducible by some vector in the test set. □ 

Under a certain genericity hypothesis on the matrix A, the elements in the 
Grobner basis are in bijection with the neighbors of the origin, which is a test set for 
integer programming introduced by Herbert Scarf |Sca| . The connection between 
neighbors and Grobner bases was studied in a commutative algebra setting in |PS| . 

Let us assume that the Grobner basis Ga,c is known to us in some explicit 
or implicit form. If we are given any feasible solution u £ W 1 then the integer 
programming problem 1)1.1(1 can be solved by the following one-line algorithm: 

(2.1) While there exists g £ Ga,c with g + < u do replace u by u — g. 

The problem of constructing a first feasible solution u from the right hand side b 
can be solved by the same reduction process but for a different Grobner basis. The 
idea is completely analogous to Phase One in the Simplex Algorithm. To keep our 
discussion simple, we will assume that some feasible solution u is known beforehand. 

One of the objectives of this lecture is to dispel the belief, held by many experts 
in complexity theory and combinatorial optimization, that the algebraic notion of 
Grobner bases is utterly useless when it comes to designing efficient algorithms. 
Let me begin by pointing out that computing Grobner bases is easy and fun. 

My currently favorite tool for producing the Grobner basis Ga.c from the ma- 
trix A and the cost vector c is the software 4ti2 developed by Raymond Hcm- 
mecke. It can be found at www.4ti2.de and is ridiculously easy to download and 
run. It took me (= a technologically challenged individual) precisely three minutes 
to install 4ti2 on my (ancient) computer, and another minute later I was already 
enjoying my first Grobner basis on the screen. Actually, I don't recall having ever 
encountered a piece of mathematical software that was simpler to use than 4ti2. 

The first non-coin example I tried had d = 3 and n — 7. The matrix A was 
filled snakewise by prime numbers and the vector c was filled by square integers. 
The input to 4ti2 consists of a matrix in a file named example in the format 
7 3 

2 3 5 7 11 13 17 
43 41 37 31 29 23 19 
47 53 59 61 67 71 73 

and a cost vector in a file example . cost in the format 
7 1 

1 4 9 16 25 36 49 

After typing groebner example and hitting the return key, about three seconds 
later, the Grobner basis appeared in a new file called example .gro: 

7 241 

-6 14 -11 2 1 
-6 13 -9 2 -1 1 
0-120-1-11 
-4 11 -7 -1-2 3 
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10 -3 -35 38 -1 -7 
22 -53 39 -2 -4 
4 11 -46 40 -7 



This Grobner basis consists of 241 vectors in Z 7 , and it represents a solution to the 
parametric problem of minimizing Y^l=i * 2 ' u i over all vectors heN 7 that satisfy 

I 2 3 5 7 11 13 17\ 

(2.2) 43 41 37 31 29 23 19 • u = b. 
\47 53 59 61 67 71 73/ 

Knowing the 241 vectors in the Grobner basis, we can now apply the reduction 
algorithm (|2.1() starting with any given feasible solution u. Take, for instance, 
u = (100, 100, 100, 100, 100, 100) T . The corresponding right hand side is b = A ■ 
u = (5800,22300,43100). The algorithm in lf2~T) reduces u to the optimal solution 
u* = (62, 8, 176, 17, 423, 0, 0). The optimal value is found to be c* = 12, 525. 

Knowledge of the Grobner basis allows us to answer more advanced structural 
questions about the system l|2.2[l . One such question is that of finding the integer 
programming gap, a topic to be discussed in Section 4. Another example is the 
question of sensitivity analysis with respect to the cost function. Suppose that the 
cost vector is allowed to vary in a neighborhood of the given vector c. Then the 
Grobner basis Q = Ga,c remains unchanged provided c ranges in the Grobner cone, 
which is defined by the following linear inequalities in the unknowns ci, . . . , c n : 

(2.3) c • g > for all g G Q. 

For instance, in our coin example, the Grobner cone is the set of all solutions to 

3c2 + C4 > 3c3 , 6C2 > 5ci + C4 , 3c2 + 4c3 > 5ci + 2c4 , 5ci + 3c4 > 8C3. 

The collection of all Grobner cones in R" forms the Grobner fan of the matrix 
A. This is an important invariant which allows us study how the solution of (|1.1|) 
changes as both b and c are allowed to vary. See |STI for the basic theory. 

The Grobner fan of a matrix A can be efficiently calculated using the algorithm 
of Huber and Thomas [HT . A highly efficient implementation was recently given 
by Anders Jensen in his program CaTS. This piece of software can currently be 
found at the web page http://www.soopadoopa.dk/anders/cats/cats.html 

3. Hilbert bases and Graver bases 

Grobner bases are closely related to other natural notions of test sets arising 
in the theory of integer programming. A classical such notion is that of a Hilbert 
basis. Consider the problem of solving a homogeneous system of linear equations 
over the non-negative integers. As before, we assume that the defining matrix A 
has d rows and n columns. Then our solution set is the following semigroup: 

(3.1) ker N (A) = { u E N" : A-u = 0}. 
Consider the subset of non-zero minimal elements of the semigroup: 

(3.2) Ha = { u 6 kerN(A)\{0} : no element v £ kerN(A)\{0, u} satisfies?; < u}. 
The following result is due to the 19th century invariant theorist Paul Gordan: 

Proposition 3.1. The set Ha *s finite. It is the unique minimal set such that 
every vector in kerpj(^4) is an N -linear combination of elements in Ha- 
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The finite set Ha is called the Hilbert basis of the matrix A. Hilbert bases can 
also be computed using the program 4ti2. We consider the same matrix as in (|2.2|l 
but we now alternate the sign pattern of its columns in the input file example: 
7 3 

2-3 5-7 11 -13 17 
43 -41 37 -31 29 -23 19 
47 -53 59 -61 67 -71 73 

After typing hilbert example and hitting the return key, about six seconds later, 
the Hilbert basis appears on a new file called example .hil: 
7 1305 

4 34 62 38 3 1 
4 35 64 38 2 1 2 
4 60 123 77 1 5 
4 36 66 38 1 2 3 



673 980 2 647 324 
674 982 1 648 325 
675 984 649 326 

The Hilbert basis consists of 1,305 vectors, and it has a lot of internal structure. 

Hilbert bases play an important role in the recent work of Robert Weismantel 
and his collaborators on "primal methods in integer programming" . The paper 
HKW introduces the notion of integral basis which is a slight generalization of 
Hilbert bases, and it presents a simplex-like integral basis algorithm which is shown 
to perform very well on standard benchmark problems in integer programming. 

A larger test set associated with an integer matrix A is the Graver basis, which 
can be defined as follows. For any sign pattern a G {— 1, +1}" let D a be the 
n x n-diagonal matrix with i-th entry o~i. The Graver basis of A is the finite set 

(3.3) gn A := (J D a -n ADa 

o-e{-i,+i}" 

In this definition, we are taking the union over the 2 n Hilbert bases for the various 
matrices A ■ D CT . The signs are adjusted so that each Hilbert basis lies in the kernel 
of the original matrix A. Proposition 13. II ensures that the Graver basis QTZ a is a 
finite subset of kerz(A). The following result is proved in |Stul §7]. 

Proposition 3.2. The Graver basis is a universal Grobner basis. It contains, 
up to negating vectors, the Grobner bases of A for all cost functions. In symbols, 

(3.4) U g A , c c gn A . 

The Graver basis is the ultimate test set one can compute for a given integer 
matrix A. It provides a parametric solution to the integer programming problem 
(|1.1|) when both the right hand side b and the cost function c are allowed to vary. 

Example 3.3. If A is the 2 x 4- matrix l|1.2fl in our coin problem, then 

gn A = { (0,3,-4,1), (-5,6,0,1), (-5,3,4,-2), (5, 0, -8, 3) (-5, 9, -4, 0)}. 

This Graver basis has only one more element than the Grobner basis 1)1. 4p . The 
advantage of the Graver basis over the Grobner basis is that we can now use (|2.1|) 
to solve the coin problem with respect to an arbitrary cost vector c. □ 
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The Graver basis has another natural interpretation in integer programming. 
Consider our original problem but now add the requirement that the coordi- 
nates Ui of the solution u are bounded above by some quantities a^. 

(3.5) Minimize c • u subject to A ■ u — b , u G N" , and u < a. 

Here we regard A and c as fixed and (a, b) G Z n+d as unspecified. It turns out that 
the Graver basis is the unique minimal test set for this family of integer programs. 

Theorem 3.4. Let u be a feasible solution of ^.y..5|) . Then u is non-optimal if 
and only if there exists g G QTZ a with g + < u and g~ < a — u, and in this case 
u — g improves u. There is no smaller set than QTZa which has this property. 

PROOF. We must prove the only if direction. Suppose u is non-optimal for 
(|3.5|l and let v be the corresponding optimal solution. Pick a G { — 1, +1}™ so that 
D a (v — u) is a nonnegative vector. There exist elements hi, . . . , h r in the Hilbert 
basis Hd^a such that D a (v —u) — hi + • • ■ + h r . and hence 

v — u = D a hi + • ■ • + D a h r , 

where each summand lies in QTZa- Since c-(v—u) < 0, there exists at least one index 
i such that D a • hi ■ c < 0. The vector g = —D a ■ hi lies in QTZa- The construction 
implies that it satisfies g + < u and g~ = (T) a ■ hi) + < (v — u) + < a — u. 

We now show that every element g = g + — g~ of QTZa is needed in a test set for 
our problem. Suppose that c ■ g < and define a = g + + g~ and b = Ag + = Ag~ . 
With these choices of a and b, the vectors g + and g~ are the only two feasible 
solutions for (j3.5|l . Hence the move from g + to g~ must be in the test set. □ 

In light of Proposition 13.21 and Theorem 13.41 it is highly desirable to be able 
to precompute the Graver basis of a given integer matrix. An algorithm for this 
computation is available in 4ti2. But the reader should be warned that Example 
13. 31 is somewhat misleading: the Graver basis is often much larger than the Grobner 
basis and it takes much longer to compute it. Consider again our example matrix, 
7 3 

2 3 5 7 11 13 17 
43 41 37 31 29 23 19 
47 53 59 61 67 71 73 

The command graver example produces the Graver basis in a file example . gra: 

7 29417 

12 0-111 

24 57 39 -2 2 

25 59 39 -1 -1 3 



14 -9 10 81 -6 -89 -37 
64 86 11 1 -6 -48 -28 
114 229 126 -1 -6 -11 -15 
124 268 161 -7 -6 10 -6 



This Graver basis has 29, 417 elements and it took a couple of hours to compute. 
One nice feature of the Graver basis computation in 4ti2 is that the program allows 
the exploitation of symmetry. In many applications (e.g. in statistics) there is a 
group of symmetries acting on the columns of the matrix A, and the Graver basis 
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QTi-A is invariant under these symmetries. This feature allows the computation of 
some interesting Graver bases whose cardinalities are in the range of one million. 

4. The integer programming gap 

A commonly used first step towards solving a hard integer programming prob- 
lem ((1.1(1 is to begin by solving its linear programming relaxation: 

(4.1) Minimize c • u subject to A ■ u — b and u G R> . 

Linear programming problems are much easier both in practise and in theory. They 
can be solved in polynomial time using interior-point methods, and the simplex 
algorithm performs well in practise. The purpose of this section is to offer algebraic 
tools for comparing the hard problem 1)1.1(1 with the easier problem (14.1(1 . For an 
algebraic perspective on the linear programming relaxtion see [HT1 . 

As before, we fix A G Z rfx " and c£Z" and regard b G Z d as unspecified. We 
write IPopt^ c (b) for the optimal value of the integer program 1(1.1(1 and we write 
LPopt^ c (b) for the optimal value of the corresponding linear program ((4.1|) . The 
difference of these quantities is a non-negative rational number 

(4.2) IPopt AiC (6) - LPopt AjC (6) > 0. 

The integer programming gap is defined as the maximum of the differences (14.2(1 as 
b ranges over all right hand sides such that p. 1(1 is feasible: 

(4.3) gap(A,c) = max{lPopt A c (6) - LPopt A c (6) : b € Z d feasible for |TTJ } . 

It appears as if we are taking the maximum over infinitely many different values, 
one for each feasible b, but actually there are only finitely many possible values for 
((4.2(1 if A and c are fixed, so the maximum is attained. 

Example 4.1. The integer programming gap of the coin problem ((1.2|) equals 

gap(Ac) = 76/15 = 5.0666666... 

This is the maximum advantage to be gained if we allow our coins to be cut into frac- 
tional pieces. The gap is attained for the right hand side b = ( 114 ) in Example ll.il 
where IPopt^ c (b) = 6 is realized by u = (4, 2, 0, 4). The optimal value of the linear 
program gj) is 14/15 = 0.93333... and is attained by v! = (0,0,136/15,14/15). 
Thus the best way to make one dollar and fourteen cents with ten fractional coins 
is to take 136/15 dimes and 14/15 quarters. □ 

We now give a recipe for computing the gap by solving several auxiliary linear 
programming problems. For any optimal vector u G N n we define the increase set 

incr(w) := { i G {1, 2, . . . , n} : u + optimal }. 

A vector u G W 1 is said to be maximally optimal for ((1.1(1 if u + a is optimal for all 
vectors a G W 1 whose support {i : a. L > 0} is a subset of the increase set incr(u). 
For any fixed maximally optimal u G N™ , we consider the following linear program: 

(4.4) Maximize c-(u — v) subject to A-(u — v) = and Vi > for all i €" incr(it). 

Here the decision variables are the coordinates of v = (vi, . . . , v n ). 

Theorem 4.2. (Ho§ten and Sturmfels HS2 ) The maximum of the optimal 
values of the auxiliary linear programs i4-4V , as u ranges over all maximally optimal 
solutions to coincides with the integer programming gap, gap(^4, c). 
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Example 4.3. Our coin problem 1(1.2(1 has three maximally optimal solutions: 

(4.5) (4,2,0,0), (0,2,0,2), (0,5,3,0). 

In each case the increase set incr(it) is indicated by the underlined coordinates. 
The vectors in 1)4.5(1 are easily derived from the Grobner basis ((1.3(1 . For instance, 
the last portfolio (consisting of five nickels and three dimes) is maximally optimal 
because adding one nickel, dime or quarter makes that portfolio non-optimal but 
adding any number of pennies is fine. The program 1(4.4(1 for that portfolio equals 

Maximize 4 — V2 — subject to v\ + v 2 + U3 + V4 = 4 
V\ + 5v2 + 10t>3 + 25t>4 = 60 , t>2 > and V4 > 0. 

The optimal value is 4. The optimal values of ((4.4(1 for (4, 2,0,0) and (0, 2, 0, 2) are 
76/15 and 5 respectively, and hence gap(A, c) = max{4, 5, 76/15} = 76/15. □ 

Theorem 14.21 furnishes an algorithm for computing gap(A, c) because the set 
of maximally optimal solutions to ((1.11) is always finite and can be computed from 
the Grobner basis Ga,c by the algebraic process of irreducible decomposition of 
monomial ideals. A highly efficient implementation of this process was developed 
by Alex Milowski in his Master's thesis project at San Francisco State University. 
The non-trivial gap computations in Examples 14.41 and 16.31 were done by Ho§ten 
and Milowski using 4ti2 (to derive the Grobner basis), Milowski's software (to get 
the maximally optimal solutions) and maple (to solve the linear programs I(4.4|) l 

Example 4.4. Let d = 3, n = 7 and consider the instance discussed in 12.2(1 : 

/ 2 3 5 7 11 13 17\ 

A = 43 41 37 31 29 23 19 , c = (1, 4, 9, 16, 25, 36, 49). 
\47 53 59 61 67 71 73/ 

There are 553 maximally optimal solutions, and the gap is 

(4.6) gap(Ac) = 43771/183 = 239.1857923 

The gap is attained by the right hand side b = (661, 1710, 3994) T . For this choice 
of b, the optimal value of 1(1. 1() equals 1,757 and is given by the optimal solution 
u = (7, 4, 0, 22, 0, 3, 26), while the optimal value of ((4.1|) is a little less than 1, 518 and 
is given by the optimal solution u = (0, 0, 0, 0, 14029/244, 463/366, 521/732). □ 

5. Short rational generating functions 

The importance of rational generating functions for lattice point problems has 
been known to combinatorialists for a long time. Their role as an efficient tool in 
integer programming, however, has been recognized only quite recently, in response 
to the polynomial time algorithms of Barvinok Barj and Barvinok- Woods BW . 
This work was further extended by De Loera et.al. [DliHHSY , DHHHY]. This 
section reports on these methods and their implementations in the software LattE. 

As a point of entry consider the following variant of our problem: List all 
optimal solutions to the integer program ((1.1(1 . For a concrete example take d = 
1, n = 4, A = ( 1 1 1 1 ), c = (0, 0, 0, 1), and suppose b > 0. Here (|LT|) equals 

(5.1) Minimize M4 subject to u\ + U2 + 1*3 + U4 = b and ui, 112, U3, U4 G N. 
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The set of optimal solutions is the set of all lattice points (m, U2, U3, 0) in a large 
triangle. We can write them all down as the terms of the generating function 

b b—u± 

2-^i 1 2 3 4 — 2-^i 2-^i 1 2 3 

u optimal for J5.H «i=0« 2 =0 

The number of terms in this series equals (b + l)(b + 2)/2 = 0(b 2 ). This quantity 
is exponential in the size of the input, which is 0(log(6)). Indeed, the number of 
bits needed to write down the line (|5.1|) grows like the logarithm of the integer b, 
while the number of terms on the right hand side of l|5.2|l is exponential in log(6). 
It appears to be impossible to "list" all feasible solutions to l|5.1(l in polynomial 
time, given that their number grows exponentially in the input size. Nonetheless, 
it can be done, namely, by rewriting 1)5.2(1 as the short rational generating function 

^■(i--r 1 (i--r 1 + ^■(i-V(i--r 1 + ^--r^--)- 1 - 

Xl Xl x 2 x 2 x 3 x 3 

The reader is invited to check that this rational function equals the series 1(5.211 . The 
rational function can be computed in time 0(log(6)) and it represents the "list" 
of all optimal solutions to ((5.1(1 . This approach works for any integer program: 

Theorem 5.1. Suppose that d and n are fixed. Then the number of optimal so- 
lutions to ((1.1(1 and the rational generating function x u : u optimal for ((_?._? (1 }, 
which encodes the set of optimal solutions, can be computed in polynomial time. 

Proof. The optimal value c* of 1(1.1(1 can be computed in polynomial time 
using Lenstra's algorithm (Lenj . Now apply Barvinok's lattice point algorithm 
|Bar| to the polytope { u s R" : Au — b, c ■ u — c* }. It computes the desired 
generating function and its evaluation at (1,1,..., 1) in polynomial time. □ 

The techniques underlying Barvinok's algorithm were developed substantially 
further by Barvinok and Woods BW . Using their Projection Theorem, one can 
derive polynomial-time algorithms based on rational generating functions for es- 
sentially all of the algorithmic questions we have encountered so far. We refer to 
BW| and [DHHHSY] for proofs of various parts of the following theorem. 

Theorem 5.2. Consider a matrix A G Z dxn and a vector c £ 1 d whose dimen- 
sions d and n are fixed. Then the rational generating functions which encode the 
following sets can be computed in time polynomial in the bit complexity of A and c: 

(1) the Grobner basis Ga,c, 

(2) the set Opt^ c of all optimal solutions, 

(3) the set Min(N n \Opt j 4 c ) of minimally non- optimal points, 

(4) the Hilbert basis TIa, 

(5) the Graver basis GTi-A, 

(6) the set of maximally optimal solutions, and 

(7) the integer programming gap gap(^4, c). 

The result (7) about the gap appears in HS2 . The objects in (l)-(6) are 
highly structured subsets of IT 1 . It is this special structure which allows for a short 
encoding. For encoding the Grobner basis Ga,c, the paper DHHHSY uses a 
generating function in 2n variables as in 11.5(1 . But all the sets in (l)-(6) can also 
be coded as formal sums of Laurent monomials (representing vectors in Z n ), and 
the Barvinok- Woods method will give short rational functions for these encodings. 
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A magnificent computer program for solving lattice point problems by means 
of short rational generating functions has been developed by the group of Jesus 
De Loera at UC Davis. It is called LattE and can be obtained at the web site 
http : //www. math. ucdavis . edu/^latte/ This program can be used to count 
the number of feasible solutions to an integer program as follows. 

Consider our coin problem in (|1.2(l with b = ( 5 9 9 9 ) , so we wish to arrange 999 
coins to be worth fifty dollars. In order to determine in how many ways this can 
be accomplished, we create the following LattE input file which we call coins: 

6 5 

999 -1 -1 -1 -1 
5000 -1 -5 -10 -25 

10 

10 

1 

1 
2 12 

The command latte equ coins will count the number of feasible solutions to 
The output which appears on the screen reveals that the answer is 9, 352: 

This is LattE vl.O beta. (September 17, 2002) 
Revised version. (Aug 1, 2003) 

The polytope has 4 vertices. 

Creating generating function. 
Starting final computation. 

**** THE GRAND TOTAL IS: 9352 **** 

Computation done. 
Time: 0.01 sec 

This run of LattE has created the following output on a new file called coins .maple: 

gF : =x [0] ~4999170*x [1] ~ (-8000506) *x [2] * 1000*x [3] ~ 166/ ( ( 1-x [0] ~ (-4995) 
*x [1] ~7993*x [2] ~ (-1) *x [3] ~ (-1) ) * (1-x [0] ~5*x [1] ~ (-9) *x [3] ~ (-1) ) ) + 
x [0] ~4999170*x [1] ~ (-8000506) *x [2] ~ 1000*x [3] " 166/ ( ( 1-x [0] ~ (-9985) 

. . . *x[l]~7993*x[2]~(-l)*x[3]~(-l))*(l-x[0]-(-5)*x[l]-9*x[3])); 

This is the short rational generating function representing the formal sum of 9, 352 
monomials, one for each feasible solution. You can get the expanded form of this 
generating function reading the file coins .maple into the computer algebra system 
maple. After you have done this, please type the maple command simplify (gF) ; . 

The program LattE can also be used to solve the minimization problem 
To this end we need to add the cost vector 10 1 in an extra line at the end 
of the input file coins. Typing now the LattE command sequence ./latte equ 
min coins2, we obtain the following output on the screen 

All cones have been decomposed. 
6 cones in total. 

Computing the points in the Parallelepiped of the unimodular Cones. 
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An optimal solution for [0 10 1] is: [555 2 441 1] . 
The optimal value is: 3. 
The gap is: 3 
Computation done. 

We conclude from this LattE session that the best way of making fifty dollars with 
999 coins is to take one quarter, two nickels, 441 dimes and 555 quarters. 

I tried LattE on considerably bigger problems and I found that it performs 
quite well. The speed is particularly impressive for knapsack problems (d = 1) 
with large integer coefficients. For this class of problems, LattE is faster than the 
current version of the commerical software CPLEX on some instances. This parallels 
the observation, already made in |HS1| . that programs like CPLEX are not always 
the best choice for low-dimensional problems with large integers, given that they 
are designed for highly structured 0/1 problems with many variables. 

The authors of LattE informed me that they intend to incorporate all of the 
tasks listed in Theorem 15.21 into a future version of their program. The lesson to 
be learned here is that algebraic software like 4ti2 and LattE can definitely play a 
useful role in the box of tools available to practitioners of integer programming. 

6. Some integer programs arising in statistics 

We present an application to the statistical theory of disclosure limitation. 
See |CG| and jDFj and the references therein. Suppose we are given data in 
the form of an n-dimensional table of nonnegative integers. The aim is to release 
some marginals of the table but not the table's entries themselves. If the range of 
possible values that a particular entry can attain in any table satisfying the released 
marginals is too narrow then this entry may be exposed. This shows the importance 
of determining tight upper and lower bounds for each entry in a given table. 

A choice of marginals corresponds to fixing subsets Ft, . . . , Fk of {1, ... ,n}. It 
can be represented by a zero-one matrix A, as described in HSu| §1]. In statistical 
language, the matrix A specifies a hierarchical model for a contingency table with n 
factors. Suppose v is a table with nonnegative integer entries, where the marginals 
are computed according to a fixed hierarchical model A and let Vi 1 i 2 ...i n be a par- 
ticular cell of the table v. What we are interested in is the following table entry 
security problem: Compute optimal lower and upper bounds L and U such that 
L < Ui 1 i 2 ...i n < U for all tables u which have the same marginals as v. 

The table entry security problem is an integer program: minimize (or maxi- 
mize) Ui r i 2 ...i n over all tables with nonnegative integer entries subject to fixing the 
marginals. In order to write this integer program in the standard form Ijl.ljl . we need 
to give the precise definition of the relevant matrices A. Consider d\ X • • • X d„-tables 
with entries Ui r i 2 ...i n where 1 < ij < dj. Wc fix a hierarchical model by specifying 
Ft, . . . , Fk- The marginals of our table are computed with respect to these subsets. 
If Fi = {jt, ■ ■ ■ ,j s } then the i^-marginal is a dj t x • • ■ x dj s table b with entries 

(6.1) b kl -k s = u ii-i„- 

We define A to be the zero-one matrix with d\dz ■ ■ • d n columns representing 
the linear map that computes the marginals of tables. We let u be the vector of 
variables representing the cell entries. Then A-u represents the k lower-dimensional 
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tables computed as in (|6.1(l . The table entry security problem is 

(6.2) Minimize (Maximize) U11...1 subject to A ■ u = fo, u > 0, u integral. 

Here we only consider the cell entry W11...1 (corresponding to the first column 
of A) because there is a transitive symmetry group acting on the columns of A. 

Example 6.1. The classical transportation problem |Schl p. 221] corresponds 
to di x G? 2 -tables where the marginals are computed with respect to F\ — {1} and 
F 2 = {2}. The three-dimensional transportation problem Via concerns d\ x d 2 x da- 
tables with Fx = {1, 2}, F 2 = {1, 3}, and F 3 = {2, 3}. The marginals are 

bij = ^ Uijk , hk = 22 Uijk , bjk = ^ Uijk- 

k j i 

For a discussion from the Grobner basis perspective see |Stul §14. C]. □ 

Example 6.2. Consider the four-cycle model for binary random variables. Here 
n = 4, d, = d 2 = d 3 = d 4 = 2, Fx = {1,2}, F 2 = {2,3}, F 3 = {3,4}, and 
P4 = {1,4}. The matrix A has d\d 2 + d 2 d 3 + d 3 d 4 + d\d 4 — 16 rows and it has 
d\d 2 d 3 d± — 16 columns. We write it in 4ti2 format on a file name fourcycle: 
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16 
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The Graver basis of this matrix A consists of 106 vectors. The 4ti2 command 
graver fourcycle delivers the Graver basis on a new file fourcycle . gra: 

16 106 

0000010-11-100-1001 

-101010-10100-1-1001 

0000010-100000-101 

000010-100000-1010 

10-100000-10100000 

00-11001-100000000 

-11001-10000000000 

0-11001-10100-1-1001 
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In light of Proposition 13.21 we can use this Graver basis with the Algorithm (|2.1(l 
to solve for any cost function. In particular, we can use it solve ()6.2|) . □ 

As the parameters d\, . . . , d n increase, it becomes harder to solve the integer 
program il{j.2fl exactly. Researchers in disclosure limitation have resorted to solving 
the linear programming relaxation (|4.1|l instead: minimize (or maximize) lfj 1 i 2 ...i„ 
over all tables with nonnegative real entries subject to fixing the marginals. This 
relaxation is tractable, but it usually fails to deliver the exact integers L and U. 
One faces the problem of finding the integer programming gap for the table entry 
security problem. This application was the original motivation for the paper HS2 . 

Example 6.3. What follows may serve as a test case for future software for 
computing the gap. We consider the -ft^-model for five binary random variables. 
Here ?i = 5, fc = 10, d\ = ■ ■ ■ = d§ = 2 and the Fi are the ten two-element subsets 
of {1,2,3,4,5}. The cost function is c = —emu, corresponding to maximizing in 
(|6.2(l . The matrix A has 40 rows and 32 columns, and it has rank 16. We found 

(6.3) ga P (A,c) = 3. 

The gap is attained by the following 2x2x2x2x 2-table: 

(6.4) u — emi2 + eni2i + 2 • en2n + 2 • ei2in + 2 • e2im + e22222- 

This table is optimal for the maximization problem in (|6.2|) . The 40 entries of the 
right hand side vector b are the entries in the ten marginal 2 x 2-tables: 

/ Ull*** 1tl2***\ [4 2\ / ti***H U***12\ (5 1 

yU21*** 1<22***/ \2 lj ' ' \ V U***21 U***22,/ \2 1 

Since the unit table emu does not appear in (|(>.4|> . we have IPopt^ c (b) = 0. 

The optimal value of the linear programming relaxation equals LPopt^ c (b) = 
3. This value is attained by the following fractional 2x2x2x2x 2-table 

v = 3 • eooooo + f en2ii + \e-\ni\ + femn + \&\iiti + \e-ni\i + \zyiti\ + 

12 12 12 1 

3612222 + 3621112 + 3621121 + 3621211 + 3621222 + 3622m + 3622121- 

We invite the reader to check that the tables u and v have the same marginals. □ 
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